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Abstract 

When reasoning in description, modal or temporal logics it is often 
useful to consider axioms representing universal truths in the domain 
of discourse. Reasoning with respect to an arbitrary set of axioms 
is hard, even for relatively inexpressive logics, and it is essential to 
deal with such axioms in an efficient manner if implemented systems 
are to be effective in real applications. This is particularly relevant 
to Description Logics, where subsumption reasoning with respect to a 
terminology is a fundamental problem. Two optimisation techniques 
that have proved to be particularly effective in dealing with termi- 
nologies are lazy unfolding and absorption. In this paper we seek to 
improve our theoretical understanding of these important techniques. 
We define a formal framework that allows the techniques to be pre- 
cisely described, establish conditions under which they can be safely 
applied, and prove that, provided these conditions are respected, sub- 
sumption testing algorithms will still function correctly. These results 
are used to show that the procedures used in the FaCT system arc 
correct and, moreover, to show how efficiency can be significantly im- 
proved, while still retaining the guarantee of correctness, by relaxing 
the safety conditions for absorption. 



*This paper appeared in the Proceedings of the Seventh International Conference on 
Priciples of Knowledge Representation and Reasoning (KR'2000). 
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1 MOTIVATION 



Description Logics (DLs) form a family of formalisms which have grown out 
of knowledge representation techniques using frames and semantic networks. 
DLs use a class based paradigm, describing the domain of interest in terms of 
concepts (classes) and roles (binary relations) which can be combined using a 
range of operators to form more complex structured concepts |BHH + 91[| . A 
DL terminology typically consists of a set of asserted facts, in particular as- 
serted subsumption (is-a-kind-of) relationships between (possibly complex) 
concepts. []. 

One of the distinguishing characteristics of DLs is a formally defined 
semantics which allows the structured objects they describe to be reasoned 
with. Of particular interest is the computation of implied subsumption 
relationships between concepts, based on the assertions in the terminology, 
and the maintenance of a concept hierarchy (partial ordering) based on the 
subsumption relationship [ WS92 |. 

The problem of computing concept subsumption relationships has been 
the subject of much research, and sound and complete algorithms are now 
known for a wide range of DLs (for example flHN9C| , pH9l] , |Baa91| , |DMar| , 
HST99| ). However, in spite of the fundamental importance of terminolo- 
gies in DLs, most of these algorithms deal only with the problem of decid- 
ing subsumption between two concepts (or, equivalently, concept satisfiabil- 
ity), without reference to a terminology (but see |BDS93 , Cal9C, DDM9C, 
|HST99| |). By restricting the kinds of assertion that can appear in a termi- 
nology, concepts can be syntactically expanded so as to explicitly include all 
relevant terminological information. This procedure, called unfolding, has 
mostly been applied to less expressive DLs. With more expressive DLs, in 
particular those supporting universal roles, it is often possible to encapsulate 
an arbitrary terminology in a single concept. This technique can be used 
with satisfiability testing to ensure that the result is valid with respect to 
the assertions in the terminology, a procedure called internalisation. 

Although the above mentioned techniques suffice to demonstrate the 
theoretical adequacy of satisfiability decision procedures for terminological 
reasoning, experiments with implementations have shown that, for reasons 
of (lack of) efficiency, they are highly unsatisfactory as a practical method- 
ology for reasoning with DL terminologies. Firstly, experiments with the 
Kris system have shown that integrating unfolding with the (tableaux) sat- 

x DLs can also deal with assertions about individuals, but in this paper we will only be 
concerned with terminological (concept based) reasoning 
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isfiability algorithm (lazy unfolding) leads to a significant improvement in 
performance | BFH + 94}j . More recently, experiments with the FaCT system 
have shown that reasoning becomes hopelessly intractable when internalisa- 



tion is used to deal with larger terminologies | Hor98 |. However, the FaCT 



system has also demonstrated that this problem can be dealt with (at least 
for realistic terminologies) by using a combination of lazy unfolding and 
internalisation, having first manipulated the terminology in order to min- 
imise the number of assertions that must be dealt with by internalisation (a 
technique called absorption). 

It should be noted that, although these techniques were discovered while 
developing DL systems, they are applicable to a whole range of reasoning 
systems, independent of the concrete logic and type of algorithm. As well 
as tableaux based decision procedures, this includes resolution based algo- 
rithms, where the importance of minimising the number of terminological 
sentences has already been noted [ HS99 |, and sequent calculus algorithms, 
where there is a direct correspondence with tableaux algorithms [BFH+9S]. 

In this paper we seek to improve our theoretical understanding of these 
important techniques which has, until now, been very limited. In partic- 
ular we would like to know exactly when and how they can be applied, 
and be sure that the answers we get from the algorithm are still correct. 
This is achieved by defining a formal framework that allows the techniques 
to be precisely described, establishing conditions under which they can be 
safely applied, and proving that, provided these conditions are respected, 
satisfiability algorithms will still function correctly. These results are then 
used to show that the procedures used in the FaCT system are correct^] 
and, moreover, to show how efficiency can be significantly improved, while 
still retaining the guarantee of correctness, by relaxing the safety conditions 
for absorption. Finally, we identify several interesting directions for future 
research, in particular the problem of finding the "best" absorption possible. 



2 PRELIMINARIES 

Firstly, we will establish some basic definitions that clarify what we mean 
by a DL, a terminology (subsequently called a TBox), and subsumption and 
satisfiability with respect to a terminology, . The results in this paper are 
uniformly applicable to a whole range of DLs, as long as some basic criteria 
are met: 



■i 



Previously, the correctness of these procedures had only been demonstrated by a 



relatively ad- hoc argument [Hor97] 
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Definition 2.1 (Description Logic) Let L be a DL based on infinite sets 
of atomic concepts NC and atomic roles NR. We will identify L with the 
sets of its well-formed concepts and require L to be closed under boolean 
operations and sub- concepts. 

An interpretation is a pair 2 = (A 1 , where A 1 is a non-empty 
set, called the domain of 2, and ■ is a function mapping NC to 2 A and 
NR to 2 AlxAl . With each DL L we associate a set Int(L) of admissi- 
ble interpretations for L. Int(L) must be closed under isomorphisms, and, 
for any two interpretations 2 and 2' that agree on NR, it must satisfy 
2 G Int(L) 44> 2' G Int(L). Additionally, we assume that each DL L comes 
with a semantics that allows any interpretation 2 G Int(L) to be extended to 
each concept C G L such that it satisfies the following conditions: 

(11) it maps the boolean combination of concepts to the corresponding boolean 
combination of their interpretations, and 

(12) the interpretation C 1 of a compound concept C G L depends only on 
the interpretation of those atomic concepts and roles that appear syn- 
tactically in C. 

This definition captures a whole range of DLs, namely, the important 
DL ACC [3S91] and its many extensions. Int(L) hides restrictions on the 



interpretation of certain roles like transitivity, functionality, or role hierar- 
chies, which are imposed by more expressive DLs (e.g., |HST99| ), as these 
are irrelevant for our purposes. In these cases, Int(L) will only contain those 
interpretations which interpret the roles as required by the semantics of the 
logic, e.g., features by partial functions or transitively closed roles by tran- 
sitive relations. Please note that various modal logics [Sch91|, propositional 



dynamic logics [DL94] and temporal logics [EH85] also fit into this frame- 
work. We will use C — > D as an abbreviation for ->C U D, C <-> D as an 
abbreviation for (C — > D) n (D — > C), and T as a tautological concept, e.g., 
A U —iA for an arbitrary A € NC. 

A TBox consists of a set of axioms asserting subsumption or equality 
relations between (possibly complex) concepts. 

Definition 2.2 (TBox, Satisfiability) A TBox T for I is a finite set of 
axioms of the form C\ C C2 or C\ = C%, where C% G L. //, for some A € NC, 
T contains one or more axioms of the form A C C or A = C, then we say 
that A is defined in T. 

Let L be a DL and T a TBox. An interpretation 2 G I int(L) is a model 
of T iff, for each C\ C C2 G T , Cf C holds, and, for each C\ = C2 G T , 
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C\ = C2 holds. In this case we write X \= T . A concept C G L is satisfiable 
with respect to a TBox X iff there is an X G Int(L) with X \= X and C 1 7^ 0. 
A concept C G L subsumes a concept D G L w.r.t. X iff, for all X € I nt(L) 
with X f= X, C 1 D D x holds. 

Two TBoxes X ,X' are called equivalent (X = X'), iff, for allX G Int(L), 
X \=X\ffX \= X'. 

We will only deal with concept satisfiability as concept subsumption 
can be reduced to it for DLs that are closed under boolean operations: C 
subsumes D w.r.t. X iff (D n —>C) is not satisfiable w.r.t. X. 

For temporal or modal logics, satisfiability with respect to a set of formu- 
lae {Cij . . . , Ck} asserted to be universally true corresponds to satisfiability 
w.r.t. the TBox {T = Ci, . . . , T = C n }. 

Many decision procedures for DLs base their judgement on the exis- 
tence of models or pseudo- models for concepts. A central role in these 
algorithms is played by a structure that we will call a witness in this 
paper. It generalises the notions of tableaux that appear in DL tableau- 



algorithms [HNS90, BBH96, HST9E] as well as the Hintikka- structures that 
are used in tableau and automata-based decision procedures for temporal 
logic |EH85| and propositional dynamic logic [|VW86f . 



Definition 2.3 (Witness) Let L be a DL and C £ L a concept. A witness 
W = (A w ,- w ,jC w ) for C consists of a non-empty set A w , a function - w 
that maps NR to 2 A xA , and a function £ w that maps A w to 2 L such 
that the following properties are satisfied: 

(Wl) there is some x 6 A w with C G C w (x), 

(W2) there is an interpretation X G Int(L) that stems from W, and 

(W3) for each interpretation X G I nt(L) that stems from W, it holds that 
D G implies x G D 1 . 

An interpretation X = (A 1 , - 1 ) is said to stem from W if it satisfies: 

1. A 1 = A w , 

2. - j |nr = - W , and 

3. for each A G NC, A G C w {x) =4> x G A 1 and -^A G £ w (x) => x £ 
A 1 . 

A witness W is called admissible with respect to a TBox X if there is an 
interpretation X G I nt(L) that stems from W with X \= X. 
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Please note that, for any witness W, (W2) together with Condition 3 of 
"stemming" implies that, there exists no x G A w and A G NC, such that 
{A, ~<A} C C w (x). Also note that, in general, more than one interpretation 
may stem from a witness. This is the case if, for an atomic concept A G 
NC and an element x G A w , C w (x) n {A, -^A} = holds (because two 
interpretations X and X', with x G A T and x G ~^A X , could both stem from 
W). 

Obviously, each interpretation X gives rise to a special witness, called 
the canonical witness: 

Definition 2.4 (Canonical Witness) Let L be a DL. For any interpreta- 
tion X G Int(L) we define the canonical witness Wx = (A Wl , - Wl , £ Wl ) as 
follows: 

• = • |nr 

£ Wi = Ax.{D G L | x G Z? J } 

The following elementary properties of a canonical witness will be useful 
in our considerations. 

Lemma 2.5 Let L be a DL, C G L, and X a X5oa;. For eac/i X G Int(L) 
with C x + 0, 

L eac/i interpretation X' stemming from Wx is isomorphic to X 

2. VVr is a witness for C , 

3. Wx is admissible w.r.t. X iffI\=T 
Proof. 

1. Let X' stem from Wj. This implies A 1 ' = A 1 and -^'Inr = '^Inr- 
For each x G A 1 and A G NC, {A, ^A} n C Wx {x) / 0, this implies 
' X Inc = '"^Inc an< i hence X and X' are isomorphic. 

2. Properties (Wl) and (W2) hold by construction. Obviously, X stems 
from Wx and from (1) it follows that each interpretation X' stemming 
from Wx is isomorphic to X, hence (W3) holds. 

3. Since X stems from Wx, X |= X implies that Wx is admissible w.r.t. 
X. If Wi is admissible w.r.t. X, then there is an interpretation X' 
stemming from Wx with X' ^= X. Since X is isomorphic to X', this 
implies X |= X. ■ 
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As a corollary we get that the existence of admissible witnesses is closely 
related to the satisfiability of concepts w.r.t. TBoxes: 



Lemma 2.6 Let L be a DL. A concept C G L is satisfiable w.r.t. a TBox T 
iff it has a witness that is admissible w.r.t. T. 

Proof. For the only indirection let Z G I nt(L) be an interpretation with 
Z \= T and C 1 ^ 0. From Lemma 2.5 it follows that the canonical witness 



Wj is a witness for C that is admissible w.r.t. T . 

For the ^/-direction let W be an witness for C that is admissible w.r.t. 
T. This implies that there is an interpretation Z G I nt(L) stemming from 
W with Z \= T. For each interpretation Z that stems from W, it holds that 
C x ^ due to (Wl) and (W3). ■ 

From this it follows that one can test the satisfiability of a concept w.r.t. 
to a TBox by checking for the existence of an admissible witness. We call 
algorithms that utilise this approach model-building algorithms. 

This notion captures tableau-based decision procedures, [HNS9G, BBH96| , 
HST991 , those using automata-theoretic approaches fVW86| , |CDL99 ] and, 
due to their direct correspondence with tableaux algorithms [ HS99| , BFH + 9£], 
even resolution based and sequent calculus algorithms. 

The way many decision procedures for DLs deal with TBoxes exploits 
the following simple lemma. 



Lemma 2.7 Let L be a DL, C S L a concept, and T a TBox. Let W be a 
witness for C. If 



ci = c 2 € r => 

then W is admissible w.r.t. T . 



\tx e A w .(Ci - C 2 e C w (x)) 
Vx G A w .(d ^ C 2 G £ w (x)) 



Proof. W is a witness, hence there is an interpretation Z G I nt(L) stemming 
from W. From (W3) and the fact that W satisfies the properties stated in 



2.7 it follows that, for each x G A x , 
Ci = C 2 G T 



C 2 G £ w (x) 
x 



c 2 y 



Ci 

x G (Ci 

Ci «-► C 2 G £ w (x) 
x G (C% <-> C 2 ) 3 



Hence, X |= T and W is admissible w.r.t. T. ■ 
Examples of algorithms that exploit this lemma to deal with axioms 
can be found in [[DDM96| , pL96| , pST99fl , where, for each axiom C x C C 2 
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(Ci = C2) the concept C\ — > C2 {C\ *-* C%) is added to every node of the 
generated tableau. 

Dealing with general axioms in this manner is costly due to the high 
degree of nondeterminism introduced. This can best be understood by look- 
ing at tableaux algorithms, which try to build witnesses in an incremental 
fashion. For a concept C to be tested for satisfiability, they start with 
A w = {x }, C w (x ) = {C} and - W (R) = for each R E NR. Subsequently, 
the concepts in £ w are decomposed and, if necessary, new nodes are added 
to A w , until either W is a witness for C, or an obvious contradiction of 
the form {A, ~>A} C which violates (W2), is generated. In the latter 

case, backtracking search is used to explore alternative non-deterministic de- 
compositions (e.g., of disjunctions), one of which could lead to the discovery 
of a witness. 

When applying Lemma |2.7| , disjunctions are added to the label of each 
node of the tableau for each general axiom in the TBox (one disjunction for 
axioms of the form C\ C C2, two for axioms of the form C\ = C2). This 
leads to an exponential increase in the search space as the number of nodes 
and axioms increases. For example, with 10 nodes and a TBox containing 10 
general axioms (of the form C\ C C2) there are already 100 disjunctions, and 
they can be non-deterministically decomposed in 2 100 different ways. For 
a TBox containing large numbers of general axioms (there are 1,214 in the 
Galen medical terminology KB RNG93fl ), this can degrade performance 



to the extent that subsumption testing is effectively non-terminating. To 
reason with this kind of TBox we must find a more efficient way to deal 
with axioms. 

3 ABSORPTIONS 

We start our considerations with an analysis of a technique that can be used 
to deal more efficiently with so-called primitive or acyclic TBoxes. 

Definition 3.1 (Absorption) Let L be a DL and T a TBox. An absorp- 
tion of T is a pair of TBoxes (T u , T g ) such that T = T u L)T g and T u contains 
only axioms of the form i C D and —>A C D where A G NC. 

An absorption (T u , T g ) of T is called correct if it satisfies the following 
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condition. For each witness W, if, for each x G A w , 

AQDeT u AA£ C w {x) => D G C w (x) 
-i A Q D £ T u A -i A G C w (x) D G £ w (x) 

Ci E C2 G T 9 =► Ci C 2 G C w (x) 
Ci = C 2 G T g Ci ~ C 2 G £ w (x) 

i/ien W is admissible w.r.t. T. We refer to this properties by (*). A witness 
that satisfies (*) will be called unfolded w.r.t. T . 

If the reference to a specific TBox is clear from the context, we will often 
leave the TBox implicit and say that a witness is unfolded. 

How does a correct absorption enable an algorithm to deal with axioms 
more efficiently? This is best described by returning to tableaux algorithms. 
Instead of dealing with axioms as previously described, which may lead to an 
exponential increase in the search space, axioms in T u can now be dealt with 
in a deterministic manner. Assume, for example, that we have to handle 
the axiom A = C. If the label of a node already contains A (resp. —>A), 
then C (resp. —>C) is added to the label; if the label contains neither A nor 
—>A, then nothing has to be done. Dealing with the axioms in T u this way 
avoids the necessity for additional non-deterministic choices and leads to a 
gain in efficiency. A witness produced in this manner will be unfolded and 
is a certificate for satisfiability w.r.t. T. This technique is generally known 



as lazy unfolding of primitive TBoxes | Hor98 |; formally, it is justified by the 
following lemma: 

Lemma 3.2 Let (T u ,T g ) be a correct absorption of T . For any C G L ; C 

has a witness that is admissible w. r. t. T iff C has an unfolded witness. 

Proof. The indirection follows from the definition of "correct absorption". 
For the only ^/-direction, let C G L be a concept and W a witness for C 
that is admissible w.r.t. T. This implies the existence of an interpretation 
X G Int(L) stemming from W such that I \=T and C 1 7^ 0. Since T = T u UT g 
we have X \= T u U T g and hence the canonical witness Wj is an unfolded 
witness for C. m 
A family of TBoxes where absorption can successfully be applied are 
primitive TBoxes, the most simple form of TBox usually studied in the 
literature. 

Definition 3.3 (Primitive TBox) A TBox T is called primitive iff it 
consists entirely of axioms of the form A = D with A G NC, each A G NC ap- 
pears as at most one left-hand side of an axiom, and T is acyclic. Acyclicity 
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is defined as follows: A G NC is said to directly use B G NC if A = D G T 
and B occurs in D; uses is the transitive closure of "directly uses". We say 
that T is acyclic if there is no A G NC that uses itself. 

For primitive TBoxes a correct absorption can easily be given. 
Theorem 3.4 Let T be a primitive TBox, T g = %, and T u defined by 

T u = {AQ D,^AQ ->D \ A = D e T}. 
Then (T u ,T g ) is a correct absorption of T . 

Proof. Trivially, T = T u U T g holds. Given an unfolded witness W, we have 
to show that there is an interpretation Z stemming from W with X \= T. 

We fix an arbitrary linearisation A± , . . . , A\. of the "uses" partial order 
on the atomic concept names appearing on the left-hand sides of axioms in 
T such that, if A4 uses Aj, then j < i and the defining concept for Ai is Dj. 

For some interpretation Z, atomic concept A, and set X C A 1 , we denote 
the interpretation that maps A to X and agrees with I on all other atomic 
concepts and roles by I[A 1— > X]. For < i < k, we define Zj in an iterative 
process starting from an arbitrary interpretation 1q stemming from W and 
setting 

T % := T;_i[A ^ {x G A w | x G Df- 1 }] 

Since, for each Ai there is exactly one axiom in T, each step in this 
process is well-defined. Also, since Int(L) may only restrict the interpretation 
of atomic roles, 2j £ Int(L) for each < i < k. For I = 2^ it can be shown 
that X is an interpretation stemming from VV with X \=T. 

First we prove inductively that, for < i < k, Zj stems from W. We 
have already required Iq to stem from W. 

Assume the claim was proved for Zj_i and Zj does not stem from W. 
Then there must be some x G A w such that either (i) Ai G but 
x or (ii) ^Ai G but x G A^ 1 (since we assume Zj_i to stem 

from W and is the only atomic concept whose interpretation changes 
from Zj_i to Zj). The two cases can be handled dually: 

(i) From Ai G C w (x) it follows that Di G C w (x), because W is unfolded. 
Since Zj_i stems from W and W is a witness, Property (W3) implies 
x G D,^ 1 . But this implies x G which is a contradiction. 
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(ii) From -<Ai € C w (x) it follows that ->D; L G £ w (x) because W is un- 
folded. Since Zj_i stems from W and W is an witness, Property (W3) 
implies x 6 (-.A) 2 * -1 - Since (-.A) 1 *" 1 = A w \ L>f i_1 this implies 
x Aj*, which is a contradiction. 

Together this implies that Zj also stems from W. 

To show that Z \= T we show inductively that Zj |= Aj = Dj for each 
1 < J < This is obviously true for i = 0. 

The interpretation of D{ may not depend on the interpretation of Aj 
because otherwise (12) would imply that Aj uses itself. Hence D i i = D i 1-1 
and, by construction, Zj |= Aj = Z>j- Assume there is some j < % such that 
Zj ^= Aj = Dj. Since li—i (= = Dj and only the interpretation of Ai has 
changed from Zj_i to Zj, Z)J* 7^ D^ 1 must hold because of (12). But this 
implies that Ai occurs in Dj and hence Aj uses Ai which contradicts j < i. 
Thus, we have I \= Aj = Dj for each 1 < j < k and hence Z |= Z. ■ 

Lazy unfolding is a well-known and widely used technique for optimising 



reasoning w.r.t. primitive TBoxes | BFH + 94 |. So far, we have only given a 



correctness proof for this relatively simple approach, although one that is 
independent of a specific DL or reasoning algorithm. With the next lemma 
we show how we can extend correct absorptions and hence how lazy unfold- 
ing can be applied to a broader class of TBoxes. A further enhancement of 
the technique is presented in Section [5|. 

Lemma 3.5 Let (T u ,T g ) be a correct absorption of a TBox Z. 

1. If T is an arbitrary TBox, then (T u ,T g U Z') is a correct absorption 
of TUT'. 

2. If T is a TBox that consists entirely of axioms of the form A C D, 
where A € NC and A is not defined in T u , then (T U UT' , T g ) is a correct 
absorption of T U Z' . 

Proof. In both cases, T u U T g U T' = T U T' holds trivially. 

1. Let C G L be a concept and W be an unfolded witness for C w.r.t. the 
absorption (T u ,T g U T'). This implies that W is unfolded w.r.t. the 
(smaller) absorption (T u ,T g ). Since (T u ,T g ) is a correct absorption, 
there is an interpretation Z stemming from W with Z |= Z. Assume 
Z y= T . Then, without loss of generality,^] there is an axiom D C 
E S T such that there exists an x S D 1 \ E 1 . Since W is unfolded, 



3 Arbitrary TBoxes can be expressed using only axioms of the form CCD. 
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we have D -> E G C w (x) and hence (W3) implies x G (-iD U £) x = 
A I \(D I \E I ), a contradiction. Hence X |= 7~U7~' and W is admissible 
w.r.t. 7" Uf. 

2. Let C G L be a concept and W be an unfolded witness for C w.r.t. the 
absorption (7^ L)2',2 g ). From W we define a new witness W' for C 
by setting A w := A w , - w := - w , and definig £ w to be the function 
that, for every x G A w , maps x to the set 

£ w (x) U{^A\A^De2',A£ £ w (x)} 

It is easy to see that W' is indeed a witness for C and that W' is also 
unfolded w.r.t. the absorption (2 U U T',T g ). This implies that W' is 
also unfolded w.r.t. the (smaller) absorption (2 u ,2 g ). Since (2 u ,2 g ) is 
a correct absorption of 7", there exists an interpretation 2 stemming 
from W' such that 7 |= 7". We will show that 2 \= T' also holds. 
Assume 2^2', then there is an axiom A C Z) G 7"' and an x G A 1 
such that x G A 1 but x D 1 . By construction of W', x G 
implies A G £ w (x) because otherwise -<A G >C W (x) would hold in 
contradiction to (W3). Then, since W is unfolded, D G £ w '(x), 
which, again by (W3), implies x G D T , a contradiction. 

Hence, we have shown that there exists an interpretation 2 stemming 
from W' such that 2 \= 2 U U 2' U 2 g . By construction of W', any 
interpretation stemming from W' also stems from W, hence W is ad- 
missible w.r.t. TUT'. ■ 

4 APPLICATION TO FaCT 

In the preceeding section we have defined correct absorptions and discussed 
how they can be exploited in order to optimise satisfiability procedures. 
However, we have said nothing about the problem of how to find an ab- 
sorption given an arbitrary terminology. In this section we will describe 
the absorption algorithm used by FaCT and prove that it generates correct 
absorptions. 

Given a TBox 2 containing arbitrary axioms, the absorption algorithm 
used by FaCT constructs a triple of TBoxes (T g , 2 prim , 7i nc ) such that 

• 2 = 2 g U 2p T [ in U 7i nc , 

• 7pri m is primitive, and 
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T mc consists only of axioms of the form A C D where A G NC and A 
is not defined in 7p rim . 



We refer to these properties by (*). From Theorem 3.4 together with 
Lemma |3.5| it follows that, for 

T u := {AQ D, ^A C ->D \ A = D £ T prim } U T inc 

(T u ,T g ) is a correct absorption of T; hence satisfiability for a concept C 
w.r.t. 7" can be decided by checking for an unfolded witness for C. 

In a first step, FaCT distributes axioms from 7" amongst 7I nc , T pT i m , and 
T g , trying to minimise the number of axioms in T g while still maintaining 
(*). To do this, it initialises 7p rim) % nc , and T g with 0, and then processes 
each axiom X G T as follows. 

1. If X is of the form A C C, then 

(a) if A G NC and A is not defined in T pr - ira then X is added to T mc , 

(b) otherwise X is added to T g 

2. If X is of the form A = C, then 

(a) if A G NC, A is not defined in T pr - ira or 71 nc and 7^ rim U {X} is 
primitive, then X is added to ^rim; 

(b) otherwise, the axioms A IZ C and C ^ A are added to 7^. 

3. If X is of the form C Q D, then add X to T g 

4. If X is of the form C = D, then add C C D, D C C to 71. 

It is easy to see that the resulting TBoxes 7^, 7p r i m , T\ nc satisfy (*). In a 
second step, FaCT processes the axioms in T g one at a time, trying to absorb 
them into axioms in 7i nc . Those axioms that are not absorbed remain in T g . 
To give a simpler formulation of the algorithm, each axiom (C C D) G 7^ is 
viewed as a clause G = {D, ^C}, corresponding to the axiom T C C -> D, 
which is equivalent to C C D. For each such axiom FaCT applies the 
following absorption procedure. 

1. Try to absorb G. If there is a concept —>A G G such that A G NC 
and A is not defined in 7^ r i m , then add A C 1? to 7; nc , where 5 is the 
disjunction of all the concepts in G \ {"^j, remove G from T g , and 
exit. 

2. Try to simplify G. 
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(a) If there is some —>C G G such that C is of the form C\ n . . . n C n , 
then substitute —>C with —>C% U . . . U ->C n , and continue with 
step 



2b 



(b) If there is some C G G such that C is of the form (C\ U . . . U C n ), 
then apply associativity by setting G = GU{Ci, . . . , C n }\{(Ci U 
. . . U C n )}, and return to step |l|. 

3. Try to unfold G. If, for some A G G (resp. —>A G G), there is an 
axiom A = C in T pv \ m , then substitute A G G (resp. ->A G G) with C 
(resp. -iC) and return to step |l[ 

4. If none of the above were possible, then absorption of G has failed. 
Leave G in T g , and exit. 

For each step, we have to show that (*) is maintained. Dealing with 
clauses instead of axioms causes no problems. In the first step, axioms are 
moved from T g to % nc as long as this does not violate (*). The second and 
the third step replace a clause by an equivalent one and hence do not violate 
(*). 

Termination of the procedure is obvious. Each axiom is considered only 
once and, for a given axiom, simplification and unfolding can only be applied 
finitely often before the procedure is exited, either by absorbing the axiom 
into 7i nc or leaving it in T g . For simplification, this is obvious; for unfolding, 
this holds because T prim is acyclic. Hence, we get the following: 

Theorem 4.1 For any TBoxT, FaCT computes a correct absorption ofT. 



5 IMPROVING PERFORMANCE 

The absorption algorithm employed by FaCT already leads to a dramatic 
improvement in performance. This is illustrated by Figure ffl, which shows 
the times taken by FaCT to classify versions of the Galen KB with some or 
all of the general axioms removed. Without absorption, classification time 
increased rapidly with the number of general axioms, and exceeded 10,000s 
with only 25 general axioms in the KB; with absorption, only 160s was taken 
to classify the KB with all 1,214 general axioms. 

However, there is still considerable scope for further gains. In particular, 
the following definition for a stratified TBox allows lazy unfolding to be more 
generally applied, while still allowing for correct absorptions. 
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Figure 1: Classification times with and without absorption 

Definition 5.1 (Stratified TBox) A TBox T is called stratified iff it 
consists entirely of axioms of the form A = D with i G NC, each A £ NC 
appears at most once on the left-hand side of an axiom, and T can be ar- 
ranged monotonously, i.e., there is a disjoint partition T\ U 7-j U . . . U of 
T , such that 

• for all 1 < j < i < k, if A € NC is defined in %, then it does not occur 
in Tj, and 

• for all 1 < i < k, all concepts which appear on the right-hand side of 
axioms in % are monotone in all atomic concepts defined in %. 

A concept C is monotone in an atomic concept A if, for any interpretation 
X G Int(L) and any two sets X\,X2 C A 1 , 

For many DLs, a sufficient condition for monotonicity is syntactic mono- 
tonicity, i.e., a concept C is syntactically monotone in some atomic concept 
A if A does no appear in C in the scope of an odd number of negations. 

Obviously, due to its acyclicity, every primitive TBox is also stratified 



and hence the following theorem is a strict generalisation of Theorem 3.4 
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Theorem 5.2 Let T be a stratified TBox, T g = and T u defined by 

T u = {AQ D, -iA C ->D \ A = D e T). 

Then (7~ u ,T g ) is a correct absorption of T . 

The proof of this theorem follows the same line as the proof of Theo- 



rem 3.4. Starting from an arbitrary interpretation Zq stemming from the 
unfolded witness, we incrementally construct interpretations Zi, . . . ,2^, us- 
ing a fixed point construction in each step. We show that each 2j stems 
from W and that, for 1 < j < i < k, Xj \=Tj, hence Z^\= T and stems from 
W. 

Before we prove this theorem, we recall some basics of lattice theory. For 
any set <S, the powerset of S, denoted by 2 s forms a complete lattice, where 
the ordering, join and meet operations are set-inclusion C, union U, and 
intersection n, respectively. For any complete lattice its n-fold cartesian 
product C n is also a complete lattice, with ordering, join, and meet defined 
in a pointwise manner. 

For a lattice £., a function $ : C — > C is called monotone, iff, for x\, Xi G 
£, x\ C X2 implies <&{xi) C <&{x2)- 



By Tarski's fixed point theorem [Tar55], every monotone function $ on 



a complete lattice, has uniquely defined least and greatest fixed points, i.e., 
there are elements x, x S C such that 

x = $>(x) and x = $>(x) 

and, for all x G C with x = <&(x), 

x C x and x C x. 

Proo/ o/ Theorem ||J. T u U T g = T is obvious. Let W = (A w , • w ,/: vv ) 
be an unfolded witness. We have to show that there is an interpretation I 
stemming from W with I \= T . Let Ti, . . . , be the required partition of 
T. We will define Z inductively, starting with an arbitrary interpretation Iq 
stemming from W. 

Assume was already defined. We define Tj from Zj_i as follows: let 
= D^, . . . ,A l m = D^} be an enumeration of %. First we need some 
auxiliary notation: for any concept C € L we define 

C W := {x G A w | C G £ W (x)}. 
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Using this notation we define the function $ mapping subsets X\ , . . . , X m 
of A w to 

( ((A\) w U (Djyzi-itKi,-,*™)) \ (^A\) w , 
((Air U (£>» m )2i-i(^.-^)) \ (^4jJ>v ) 

where 

Ti-\(X\ , ■ ■ ■ , -X,n) := [A\ ^ X\, . . . , A % m h- > X m ] 

Since all of the are monotone in all of the A l m , $ is a monontone func- 
tion. This implies that $ has a least fixed point, which we denote by 
(X_ 1: . . . ,2Lm)- We use this fixed point to define Xj by 

l i :=l i - 1 [A\^X 1 ,...,A i m ^X m ] 

Claim 1: For each < i < k, Xj stems from W. 

We show this claim by induction on i. We have already required To to 
stem from W. Assume Xj_i stems from W. Since the only thing that changes 
from Xj_i to Xj is the interpretation of the atomic concepts A\, . . . , A^, we 
only have to check that A* G £ w (x) implies x G (A*) Ji and ->Aj G £ w (x) 
implies x (A*) 2 '*. 

By definition of and because {x | A) G £ w (x)}n{x | -.Aj G C w (x)} = 
0, A* G C w (x) implies x G (A*) Ji . Also by the definition of -iA*- G £ w (x) 
implies x g" (Aj) Ji . Hence, Xj stems from VV. 
Claim 2: For each 1 < j < i < k, li \= Tj. 

We prove this claim by induction over i starting from 0. For i = 0, there 
is nothing to prove. Assume the claim would hold for Xj_i. The only thing 
that changes from Xj_i to Xj is the interpretation of the atomic concepts 
A\, . . . A l m defined in Xj. Since these concepts may not occur in Tj for j < i, 
the interpretation of the concepts in these TBoxes does not change, and 
from Xj_i |= Tj follows Xj |= Tj for 1 < j < i — 1. 

It remains to show that Xj |= %. Let A*- = Dj be an axiom from Xj. 
From the definition of Xj we have 

(A^ = ((A]r U (D^) \ (^4J) W . (1) 

W is unfolded, hence A* G £ w (x) implies D l - G £ w (x) and, since Xj 
stems from W, this implies x G (Dj) Ji , thus 

(A;.) W U (X»i) Xi = (D))^ (2) 
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Furthermore, ->Aj € C w (x) implies ->Dj G C w (x) implies x G (-iD* ) 1 *, thus 

(D^ \ = [D^ (3) 

Taking together ([[]), and (||) we get 

(^) Xl = pjfs 

and hence 2j |= A*- = Dj . 

Together, Claim 1 and Claim 2 prove the theorem, since is an inter- 
pretation that stems from W and satisfies T. ■ 

This theorem makes it possible to apply the same lazy unfolding strategy 
as before to cyclical definitions. Such definitions are quite natural in a 
logic that supports inverse roles. For example, an orthopaedic procedure 
might be defined as a procedure performed by an orthopaedic surgeon, while 
an orthopaedic surgeon might be defined as a surgeon who performs only 
orthopaedic procedures:^ 

o-procedure = procedure n (3 performs^ . o-surgeon) 
o-surgeon = surgeon n (\f performs, o-procedure) 

The absorption algorithm described in Section |3| would force the second 
of these definitions to be added to T g as two general axioms and, although 
both axioms would subsequently be absorbed into T u , the procedure would 
result in a disjunctive term being added to one of the definitions in T u . 



Using Theorem 5.2 to enhance the absorption algorithm so that these kinds 
of definition are directly added to T u reduces the number of disjunctive terms 
in T u and can lead to significant improvements in performance. 

This can be demonstrated by a simple experiment with the new FaCT 



system, which implements the STiZQ logic [HST9E] and is thus able to deal 
with inverse roles. Figure ^ shows the classification time in seconds using 
the normal and enhanced absorption algorithms for terminologies consisting 
of between 5 and 50 pairs of cyclical definitions like those described above 
for o-surgeon and o-procedure. With only 10 pairs the gain in performance is 
already a factor of 30, while for 45 and 50 pairs it has reached several orders 
of magnitude: with the enhanced absorption the terminology is classified in 
2-3 seconds whereas with the original algorithm the time required exceeded 
the 10,000 second limit imposed in the experiment. 

It is worth pointing out that it is by no means trivially true that cyclical 
definitions can be dealt with by lazy unfolding. Even without inverse roles 



'This example is only intended for didactic purposes. 
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Figure 2: Classification times with and without enhanced absorption 

it is clear that definitions such as A = ->A (or more subtle variants) force 
the domain to be empty and would lead to an incorrect absorption if dealt 
with by lazy unfolding. With converse roles it is, for example, possible to 
force the interpretation of a role R to be empty with a definition such as 
A = \/R.(yR~.->A), again leading to an incorrect absorption if dealt with by 
lazy unfolding. 

6 OPTIMAL ABSORPTIONS 

We have demonstrated that absorption is a highly effective and widely appli- 
cable technique, and by formally defining correctness criteria for absorptions 
we have proved that the procedure used by FaCT finds correct absorptions. 
Moreover, by establishing more precise correctness criteria we have demon- 
strated how the effectiveness of this procedure could be further enhanced. 

However, the absorption algorithm used by FaCT is clearly sub-optimal, 
in the sense that changes could be made that would, in general, allow more 
axioms to be absorbed (e.g., by also giving special consideration to axioms 
of the form ->A C C with A S NC). Moreover, the procedure is non- 
deterministic, and, while it is guaranteed to produce a correct absorption, 
its specific result depends on the order of the axioms in the original TBox 
T. Since the semantics of a TBox T does not depend on the order of its 
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axioms, there is no reason to suppose that they will be arranged in a way that 
yields a "good" absorption. Given the effectiveness of absorption, it would 
be desirable to have an algorithm that was guaranteed to find the "best" 
absorption possible for any set of axioms, irrespective of their ordering in 
the TBox. 

Unfortunately, it is not even clear how to define a sensible optimality 
criterion for absorptions. It is obvious that simplistic approaches based on 
the number or size of axioms remaining in T g will not lead to a useful solution 
for this problem. Consider, for example, the cyclical TBox experiment from 
the previous section. Both the original FaCT absorption algorithm and the 



enhanced algorithm, which exploits Theorem |5.2| , are able to compute a 
complete absorption of the axioms ( i.e., a correct absorption with T g = 0), 
but the enhanced algorithm leads to much better performance, as shown in 
Figure |[ 

An important issue for future work is, therefore, the identification of 
a suitable optimality criterion for absorptions, and the development of an 
algorithm that is able to compute absorptions that are optimal with respect 
to this criterion. 
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